Retrieving OCR Text: A Survey of Current Approaches
نویسندگان
چکیده
The importance of effectively retrieving OCR text has grown significantly in recent years. We provide a brief overview of work done to improve the effectiveness of retrieval of OCR text.
منابع مشابه
A Survey of Retrieval Strategies for OCR Text Collections
The importance of effectively retrieving OCR text has grown significantly in recent years. We provide a brief overview of work done to improve the effectiveness of retrieval of OCR text.
متن کاملRetrieving Arabic Printed Document: a Survey
This paper surveys some of the literature pertaining to searching and retrieving OCR’ed printed documents with emphasis on Arabic documents. It examines peculiarities of Arabic morphology, orthography, retrieval, word clustering, display, OCR, and error correction. The paper surveys existing evaluation test-beds for retrieval of Arabic OCR texts. Lastly, it concludes with possible directions fo...
متن کاملA System for Identifying and Exploring Text Repetition in Large Historical Document Corpora
We present a software for retrieving and exploring duplicated text passages in low quality OCR historical text corpora. The system combines NCBI BLAST, a software created for comparing and aligning biological sequences, with the Solr search and indexing engine, providing a web interface to easily query and browse the clusters of duplicated texts. We demonstrate the system on a corpus of scanned...
متن کاملA Survey of Math Accessibility For Blind Persons and An Investigation on Text/Math Separation
Despite recent advances, blind students, researchers, and professionals lack easy access to mathematical resources. This lack of access is a barrier to higher education for many blind students and puts them at an unfair disadvantage in school, academia, and industry. A survey of current mathematical accessibility technologies for blind persons is covered in this paper, encompassing reading, wri...
متن کاملUsing Text Surrounding Method to Enhance Retrieval of Online Images by Google Search Engine
Purpose: the current research aimed to compare the effectiveness of various tags and codes for retrieving images from the Google. Design/methodology: selected images with different characteristics in a registered domain were carefully studied. The exception was that special conceptual features have been apportioned for each group of images separately. In this regard, each group image surr...
متن کامل